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Abstract 

The genotypes of 31 autosomal short tandem repeat loci in the population of Carloforte were analyzed, these repre- 
senting a linguistic and genetic isolate located on the island of Sardinia (Italy). The markers span the entire length of 
chromosomes 1 9, 20, 21 and 22. Allele frequencies and statistical parameters were presented for all loci. Observed 
heterozygosity ranged from 0.279 to 0.884, and polymorphism information content from 0.552 to 0.886. All but two 
loci showed Hardy-Weinberg equilibrium after Bonferroni correction. The 31 short tandem repeat loci examined in 
the present work provide additional data on the genetic structure of the Carloforte population. 
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Carloforte is the only village located on the small is- 
land of San Pietro, off the southwestern coast of Sardinia. 
The history of the Carloforte population dates back to the 
16 th century, when a number of families emigrated from 
Pegli, a small village in Liguria (Italy), now part of the city 
of Genova, to Tabarka, an island off the coast of Tunisia. 
During their time in Tabarka, a period of about two centu- 
ries, the Pegli community developed a successful business 
in tuna and coral fishing. At the beginning of the 18 th cen- 
tury, due to a decline in business and worsening relation- 
ship with the Bey of Tunis and Algiers, the community 
migrated to Sardinia and settled on the deserted island of 
San Pietro, where they founded Carloforte in 1738 (Valle- 
bona, 1974). For about 10 generations, these Genovese mi- 
grants had very little contact with the mainland populations 
of both Tunisia and Sardinia, maintaining a separate cul- 
tural as well as genetic identity. The cultural aspect is evi- 
dent in the Pegli dialect, which is still spoken nowadays, 
making the Carloforte population a linguistic isolate (Vona 
et ah, 1996). Earlier studies based on matrimonial struc- 
ture, classical genetic markers and the incidence of a spe- 
cific disease provided evidence that Carloforte is also a 
genetic isolate (Vona et ah, 1996; Heath et ah, 2001). In 
this paper, we present further data on the genetic structure 
of the Carloforte population by reporting on the distribution 
of 31 micro-satellite DNA loci. The markers employed 
were CA di-nucleotide repeats clustered in four linkage 
groups, representing chromosomes 19, 20, 21 and 22. 
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Historical documents list the names of the first set- 
tlers. Individuals selected for the present study were proven 
to be descendants of the village founders. Moreover, the 
participants were chosen for not having ancestors in com- 
mon, at least up to the grandparental generation. Both crite- 
ria were ascertained by a complete genealogical search 
through the detailed family information available from City 
Hall records (Siniscalco et ah, 1999; Heath et ah, 2001). 

About 10 mL of peripheral blood were collected from 
50 voluntary participants, in vacuum tubes containing 
EDTA as anticoagulant. Permanent cultures of EBV-trans- 
formed B-lymphocytes were prepared and stored anony- 
mously in the Coriell Repository. DNA was extracted by 
standard laboratory procedures. The study was approved by 
local ethics committees and all voluntary participants read 
and signed an informed consent form, in accordance with 
Declaration of Helsinki guidelines. 

Individuals were genotyped using the ABI Prism 
Linkage Mapping Panel, version 2. The markers were am- 
plified as single reactions and products pooled for analy- 
sis. About 50 ng of genomic DNA were amplified in a 
final PCR volume of 15 uL containing 1.5 uE lOx 
GeneAmp PCR buffer II, 1.5 uE 25 mM MgCl 2 , 1.5 uL 
2.5 mM dNTP mix, 1.0 uE Primer Mix and 0.12 uL 
AmpliTaq Gold DNA Polymerase, with an ABI 
GeneAmp PCR System 9700 Thermal Cycler. PCR reac- 
tions were amplified by using the following PCR condi- 
tions: a 12 min hold at 95 °C was followed by 10 cycles of 
1 5 s at 94 °C, 1 5 s at 55 °C, 30 s at 72 °C and 20 additional 
cycles of 15 s at 89 °C, 1 5 s at 55 °C and 30 s at 72 °C, with 
a final extension of 10 min at 72 °C. 
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The amplified PCR products were pooled at a ratio of 
5 uL of each FAM labeled product and 10 uL of each HEX 
and NED labeled one. D.I. water was added to a final vol- 
ume of 100 ul. The pooled PCR product was mixed with 
loading buffer at a 2:3 ratio for a 5 uL final volume (loading 
buffer contained 4.5 uL formamide, 1.0 uL blue dextran 
and 1.5 uL of S400HD-ROX as an internal size standard). 
The mixture was heated at 95 °C for 3 min and cooled on 
ice for 3 min, before being separated on an ABI 377 Se- 
quencer, using a 36 cm denaturing polyacrylamide gel for 
resolution of di-nucleotide repeat products. Separated al- 
lele fragments were analyzed using ABI GeneScan and 
Genotyper software and genotypes scored. 

Allele frequencies were estimated by gene counting. 
Both observed and expected heterozygosity were calcu- 
lated with the GENETIX program (v. 4.0). Deviation from 
Hardy- Weinberg equilibrium (HWE) was tested by the 
Markov chain, using the GENEPOP program (v. 4.0). Sta- 
tistical parameters of population genetics and forensic in- 
terest, namely polymorphism information content (PIC), 
power of discrimination (PD), and power of exclusion 
(PE), were calculated using Power Stats package v. 1.2 
(Promega Corporation, USA). 

To verify whether the population had incurred a re- 
cent bottleneck, data were analysed using the 
BOTTLENECK program, v 1.2 (Cornuet and Luikart, 
1996). This program allows for the evaluation of observed 
and expected heterozygosity, and determines statistical dif- 
ference based on equilibrium mutation/genetic drift. This 
was obtained through simulating the coalescent process of 
n genes under the two-phase mutation model (TPM), which 
was proved to better fit micro-satellite analysis (Di Rienzo 
et al, 1994). The TPM mainly consists of one-step muta- 
tions, with a small percentage (5%-10%) of multi-step 
changes. 

Allelic frequencies of the 3 1 STR loci tested in the 
population of Carloforte are reported in Table SI. 
Hardy- Weinberg equilibrium (HWE) and population pa- 
rameters are shown in Table 1 . 

The 31 loci showed a high degree of heterozygosity, 
varying from 27.9% for marker D21S1256 to 88.4% for 
marker D22S283. The exact test based on Markov chain 
analysis revealed that 6 out of 3 1 markers did not meet 
Hardy- Weinberg equilibrium, whereas after Bonferroni 
correction, only two of these, D20S173 and D21S1256, 
showed an excess of observed homozygotes. The high de- 
gree of isolation of the Carloforte population, together with 
the reported deviation from Hardy Weinberg expectation, 
prompted us to verify the possibility of a recent bottleneck 
in Carloforte through the BOTTLENECK program. Under 
the assumption of the two-phase model of mutation (TMP), 
the Wilcoxon test revealed non-significant heterozygosity 
(p = 0.459), thereby indicating that, following initial colo- 
nization events, there were no further contractions in the 
size of the Carloforte population, this being in agreement 



Table 1 - Heterozygosity, number of alleles detected and HWE evaluation 
(indicated as probability values) of 31 investigated markers in the 
Carloforte population. 
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with historical records. Moreover, the mode shape test 
showed an L-shaped distribution, which is also consistent 
with the absence of a recent bottleneck (data not shown). 

The high level of heterozygosity in a genetic isolate 
like Carloforte, with a long history of endogamous and con- 
sanguineous marriages, may at first be surprising. We be- 
lieve that the sampling strategy accounts for the findings: 
differences among individuals were maximized by select- 
ing participants with no ancestor in common, at least up to 
the grandparental generation. This could be very important 
in association studies, where it is necessary to analyze large 
numbers of individuals in order to detect statistically signif- 
icant deviations in allelic distribution between cases and 
matched controls. In a previous study, on applying the sam- 
pling strategy as described above, we genotyped 55 indi- 
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viduals from the Carloforte population at 5 unlinked 
micro-satellite loci in order to obtain an accurate descrip- 
tion of their genomic profile. We showed that allele fre- 
quencies at all loci were practically the same down to a 
subset of 20 individuals (Siniscalco et ah, 1999). There- 
fore, once a breeding unit is identified through the record- 
ing of marriage patterns over the last five generations, a 
small sampling of contemporary descendants is still repre- 
sentative of its founder group, provided that the individuals 
selected had their ancestors all born in the same breeding 
unit, but unrelated for, at least, the last two generations. An 
obvious consequence of the proposed sampling strategy is a 
reduction in the cost of association studies, due to a lower 
number of individuals having to be genotyped. 

Finally, Table 2 shows parameters of forensic inter- 
est. The combined power of both discrimination and exclu- 



Table 2 - Statistical relevant data for 31 investigated markers in the 
Carloforte population. MP: matching probability, PD: power of discrimi- 
nation, PIC: polymorphism information content, PE: power of exclusion. 



Locus 


MP 


PD 


PIC 


PE 


\J 1 7k>Z 1 o 


0 162 


0 838 


0 616 


0 314 


U 1 7iJOO L r 


0.049 


0 95 1 


0 852 


0.402 


i j i y ozzu 


0 053 


0.947 


0 810 


0 654 


D19S414 


0 112 


0.888 


0 723 


0.510 


DI 9^770 

1 J i y >j£.z.\J 


0 081 


0 919 


0 774 


0.510 


HI 9S420 


0.120 


0.880 


0.714 


0.472 


m 9^907 


0 092 


0 908 


0 735 


0.617 


D19S571 


0.099 


0.901 


0.768 


0.490 


D19S418 


0.146 


0.854 


0.669 


0.472 


D19S210 


0.143 


0.857 


0.666 


0.343 


D20S889 


0.055 


0.945 


0.809 


0.543 


D20S115 


0.204 


0.796 


0.577 


0.308 


D20S186 


0.069 


0.931 


0.820 


0.722 


D20S112 


0.101 


0.899 


0.724 


0.451 


D20S195 


0.060 


0.940 


0.826 


0.468 


D20S107 


0.063 


0.937 


0.806 


0.677 


D20S119 


0.100 


0.900 


0.726 


0.425 


D20S178 


0.055 


0.945 


0.833 


0.751 


D20S196 


0.062 


0.938 


0.795 


0.500 


D20S100 


0.094 


0.906 


0.740 


0.425 


D20S173 


0.259 


0.741 


0.552 


0.070 


D21S1256 


0.158 


0.842 


0.692 


0.055 


D21S1914 


0.057 


0.943 


0.849 


0.313 


D21S1252 


0.062 


0.938 


0.822 


0.722 


D21S266 


0.095 


0.905 


0.705 


0.425 


D22S420 


0.122 


0.878 


0.690 


0.279 


D22S315 


0.095 


0.905 


0.714 


0.490 


D22S280 


0.097 


0.903 


0.773 


0.670 


D22S283 


0.035 


0.965 


0.886 


0.762 


D22S423 


0.063 


0.937 


0.801 


0.722 


D22S274 


0.120 


0.880 


0.739 


0.716 



sion were absolutely discriminating, their values being, 
respectively, 7.7 x 10" 33 and 0.999999999918. It has been 
reported that genetic isolation could reduce the capacity for 
paternity exclusion, without significantly affecting the 
power of discrimination (de Pancorbo et al, 2000). This 
could be a critical issue in forensics, for example when set- 
ting up a database from a small village or population. In our 
case, the power of exclusion was still very high, again prob- 
ably as a result of the sampling strategy. The combined 
matching probability for the 31 STRs was 
0.9999999999923. Therefore, the 3 1 STR loci examined in 
the present work, specifically designed for population ge- 
netics studies, also turned out to be suitable for general fo- 
rensic applications. 
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Supplementary Material 

The following online material is available for this ar- 
ticle: 

Table S 1 : Allele frequencies of the 3 1 STR loci in the 
population of Carloforte 



This material is available as part of the online article 
from http://www.scielo.br/gmb. 
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